Overview

Dataset statistics

Number of variables22
Number of observations36283
Missing cells189199
Missing cells (%)23.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 MiB
Average record size in memory179.2 B

Variable types

NUM15
CAT7

Reproduction

Analysis started2020-08-14 06:57:55.414171
Analysis finished2020-08-14 06:58:45.629136
Duration50.21 seconds
Software versionpandas-profiling v2.9.0rc1
Download configurationconfig.yaml

Warnings

name has a high cardinality: 35140 distinct values High cardinality
neighbourhood has a high cardinality: 61 distinct values High cardinality
zipcode has a high cardinality: 695 distinct values High cardinality
amenities has a high cardinality: 28222 distinct values High cardinality
monthly_price is highly correlated with weekly_priceHigh correlation
weekly_price is highly correlated with monthly_priceHigh correlation
neighbourhood has 13370 (36.8%) missing values Missing
zipcode has 20158 (55.6%) missing values Missing
beds has 380 (1.0%) missing values Missing
square_feet has 36269 (> 99.9%) missing values Missing
weekly_price has 35975 (99.2%) missing values Missing
monthly_price has 35967 (99.1%) missing values Missing
security_deposit has 23793 (65.6%) missing values Missing
cleaning_fee has 23123 (63.7%) missing values Missing
price is highly skewed (γ1 = 21.9409) Skewed
cleaning_fee is highly skewed (γ1 = 20.1208) Skewed
minimum_nights is highly skewed (γ1 = 22.4918) Skewed
maximum_nights is highly skewed (γ1 = 190.481) Skewed
name is uniformly distributed Uniform
id has unique values Unique
beds has 630 (1.7%) zeros Zeros
security_deposit has 4281 (11.8%) zeros Zeros
cleaning_fee has 4361 (12.0%) zeros Zeros
extra_people has 29249 (80.6%) zeros Zeros

Variables

id
Real number (ℝ≥0)

UNIQUE

Distinct count36283
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.27986e+07
Minimum44054
Maximum4.3837e+07
Zeros0
Zeros (%)0.0%
Memory size283.5 KiB

Quantile statistics

Minimum44054
5-th percentile1.64859e+07
Q12.81454e+07
median3.48567e+07
Q33.90941e+07
95-th percentile4.16701e+07
Maximum4.3837e+07
Range4.37929e+07
Interquartile range (IQR)1.09487e+07

Descriptive statistics

Standard deviation8.14257e+06
Coefficient of variation (CV)0.24826
Kurtosis0.825724
Mean3.27986e+07
Median Absolute Deviation (MAD)4.97636e+06
Skewness-1.09262
Sum1.19003e+12
Variance6.63015e+13
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
3.80846e+071< 0.1%
 
2.97546e+071< 0.1%
 
3.43805e+071< 0.1%
 
4.03612e+071< 0.1%
 
3.97919e+071< 0.1%
 
3.45531e+071< 0.1%
 
3.67793e+071< 0.1%
 
3.80142e+071< 0.1%
 
3.57246e+071< 0.1%
 
1.99161e+071< 0.1%
 
Other values (36273)36273> 99.9%
 
ValueCountFrequency (%) 
440541< 0.1%
 
1002131< 0.1%
 
1143841< 0.1%
 
1144651< 0.1%
 
1284961< 0.1%
 
ValueCountFrequency (%) 
4.3837e+071< 0.1%
 
4.38368e+071< 0.1%
 
4.38367e+071< 0.1%
 
4.38366e+071< 0.1%
 
4.38365e+071< 0.1%
 

name
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count35140
Unique (%)96.9%
Missing1
Missing (%)< 0.1%
Memory size283.5 KiB
嘉度resort丨近慕田峪长城、红螺寺、圣泉寺、响水湖丨房屋空间开阔,是家庭出游及公司团建不二之选
 
33
中关村颐和园北京大学新东方地铁直达
 
22
豪华特大号床间,整套45㎡。首都机场T3、新国展、黑骑士、松美术馆、国家会计学院。免费接送机、早餐。
 
18
心归宿瀚洋店-【林间小屋】独栋木屋大床房、距市区11公里、密云水库、白龙潭、平谷桃花海、垂钓
 
17
温馨小屋
 
17
Other values (35135)
36175 
ValueCountFrequency (%) 
嘉度resort丨近慕田峪长城、红螺寺、圣泉寺、响水湖丨房屋空间开阔,是家庭出游及公司团建不二之选330.1%
 
中关村颐和园北京大学新东方地铁直达220.1%
 
豪华特大号床间,整套45㎡。首都机场T3、新国展、黑骑士、松美术馆、国家会计学院。免费接送机、早餐。18< 0.1%
 
心归宿瀚洋店-【林间小屋】独栋木屋大床房、距市区11公里、密云水库、白龙潭、平谷桃花海、垂钓17< 0.1%
 
温馨小屋17< 0.1%
 
三居可随时长租1.2万◆北海北地铁口胡同大平房,北大医院、恭王府、后海、北海、南锣鼓巷、簋街、三里屯15< 0.1%
 
朝阳,四惠,高碑店,百子湾,传媒大学,第二外国语学院,东五环,超级蜂巢,房间清新整洁舒适大床房。14< 0.1%
 
中关村新东方清华大学北京大学颐和园人民大学中湾国际11< 0.1%
 
延庆绿茵溪谷古堡 双人标间 紧邻神庙峰 双龙潭 绿荫山泉 欢迎品尝特色农家菜10< 0.1%
 
距离北京西站15分钟的幸福村民宿10< 0.1%
 
Other values (35130)3611599.5%
 

Length

Max length127
Median length27
Mean length28.3088
Min length1

neighbourhood
Categorical

HIGH CARDINALITY
MISSING

Distinct count61
Unique (%)0.3%
Missing13370
Missing (%)36.8%
Memory size38.4 KiB
Chaoyang
6949 
Dongcheng
2039 
Fengtai
1839 
Haidian
1755 
Xicheng
1177 
Other values (56)
9154 
ValueCountFrequency (%) 
Chaoyang694919.2%
 
Dongcheng20395.6%
 
Fengtai18395.1%
 
Haidian17554.8%
 
Xicheng11773.2%
 
Sanlitun7041.9%
 
Jinsong/Panjiayuan5751.6%
 
Wangjing4321.2%
 
Shilipu4211.2%
 
Chongwenmen3971.1%
 
Other values (51)662518.3%
 
(Missing)1337036.8%
 

Length

Max length36
Median length7
Mean length6.85078
Min length3
Distinct count16
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size36.2 KiB
朝阳区 / Chaoyang
11583 
东城区
3656 
海淀区
3209 
丰台区 / Fengtai
2434 
延庆县 / Yanqing
2217 
Other values (11)
13184 
ValueCountFrequency (%) 
朝阳区 / Chaoyang1158331.9%
 
东城区365610.1%
 
海淀区32098.8%
 
丰台区 / Fengtai24346.7%
 
延庆县 / Yanqing22176.1%
 
密云县 / Miyun19195.3%
 
西城区18715.2%
 
通州区 / Tongzhou16324.5%
 
怀柔区 / Huairou16174.5%
 
昌平区15764.3%
 
Other values (6)456912.6%
 

Length

Max length16
Median length13
Mean length10.0023
Min length3

zipcode
Categorical

HIGH CARDINALITY
MISSING

Distinct count695
Unique (%)4.3%
Missing20158
Missing (%)55.6%
Memory size96.3 KiB
100000
 
1123
100001
 
777
100096
 
759
100000
 
739
38294
 
676
Other values (690)
12051 
ValueCountFrequency (%) 
10000011233.1%
 
1000017772.1%
 
1000967592.1%
 
1000007392.0%
 
382946761.9%
 
1000225431.5%
 
1000094221.2%
 
1000253340.9%
 
1015053060.8%
 
1001762720.7%
 
Other values (685)1017428.0%
 
(Missing)2015855.6%
 

Length

Max length10
Median length3
Mean length4.5981
Min length1

property_type
Categorical

Distinct count45
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size37.0 KiB
Apartment
14428 
Condominium
4761 
House
4129 
Loft
2960 
Serviced apartment
2189 
Other values (40)
7816 
ValueCountFrequency (%) 
Apartment1442839.8%
 
Condominium476113.1%
 
House412911.4%
 
Loft29608.2%
 
Serviced apartment21896.0%
 
Farm stay13303.7%
 
Villa12223.4%
 
Bungalow9852.7%
 
Cottage5961.6%
 
Townhouse5131.4%
 
Other values (35)31708.7%
 

Length

Max length22
Median length9
Mean length8.87807
Min length3

room_type
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size35.5 KiB
Entire home/apt
22172 
Private room
12444 
Shared room
 
1667
ValueCountFrequency (%) 
Entire home/apt2217261.1%
 
Private room1244434.3%
 
Shared room16674.6%
 

Length

Max length15
Median length15
Mean length13.7873
Min length11

accommodates
Real number (ℝ≥0)

Distinct count17
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.74211
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size283.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q34
95-th percentile10
Maximum18
Range17
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3.0897
Coefficient of variation (CV)0.825656
Kurtosis5.70825
Mean3.74211
Median Absolute Deviation (MAD)1
Skewness2.33703
Sum135775
Variance9.54622
MonotocityNot monotonic
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%) 
21630144.9%
 
4650617.9%
 
330018.3%
 
127437.6%
 
623476.5%
 
513543.7%
 
89492.6%
 
169062.5%
 
106271.7%
 
74871.3%
 
Other values (7)10622.9%
 
ValueCountFrequency (%) 
127437.6%
 
21630144.9%
 
330018.3%
 
4650617.9%
 
513543.7%
 
ValueCountFrequency (%) 
181< 0.1%
 
169062.5%
 
151310.4%
 
141470.4%
 
13830.2%
 

bathrooms
Real number (ℝ≥0)

Distinct count43
Unique (%)0.1%
Missing21
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1.42426
Minimum0
Maximum101.5
Zeros170
Zeros (%)0.5%
Memory size283.5 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q31.5
95-th percentile3
Maximum101.5
Range101.5
Interquartile range (IQR)0.5

Descriptive statistics

Standard deviation1.37529
Coefficient of variation (CV)0.96562
Kurtosis1038.81
Mean1.42426
Median Absolute Deviation (MAD)0
Skewness19.0739
Sum51646.5
Variance1.89143
MonotocityNot monotonic
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%) 
12507569.1%
 
1.5453012.5%
 
232068.8%
 
38412.3%
 
0.55181.4%
 
44711.3%
 
2.52750.8%
 
52640.7%
 
62010.6%
 
01700.5%
 
Other values (33)7112.0%
 
ValueCountFrequency (%) 
01700.5%
 
0.55181.4%
 
12507569.1%
 
1.5453012.5%
 
232068.8%
 
ValueCountFrequency (%) 
101.51< 0.1%
 
751< 0.1%
 
331< 0.1%
 
311< 0.1%
 
252< 0.1%
 

bedrooms
Real number (ℝ≥0)

Distinct count26
Unique (%)0.1%
Missing142
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean1.66307
Minimum0
Maximum50
Zeros140
Zeros (%)0.4%
Memory size283.5 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum50
Range50
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.48003
Coefficient of variation (CV)0.889938
Kurtosis90.4273
Mean1.66307
Median Absolute Deviation (MAD)0
Skewness5.79604
Sum60105
Variance2.19049
MonotocityNot monotonic
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%) 
12423466.8%
 
2678318.7%
 
323436.5%
 
49852.7%
 
55891.6%
 
64071.1%
 
72250.6%
 
81420.4%
 
01400.4%
 
101320.4%
 
Other values (16)1610.4%
 
(Missing)1420.4%
 
ValueCountFrequency (%) 
01400.4%
 
12423466.8%
 
2678318.7%
 
323436.5%
 
49852.7%
 
ValueCountFrequency (%) 
502< 0.1%
 
311< 0.1%
 
252< 0.1%
 
241< 0.1%
 
232< 0.1%
 

beds
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count46
Unique (%)0.1%
Missing380
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean2.24226
Minimum0
Maximum115
Zeros630
Zeros (%)1.7%
Memory size283.5 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile6
Maximum115
Range115
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.75371
Coefficient of variation (CV)1.2281
Kurtosis138.979
Mean2.24226
Median Absolute Deviation (MAD)1
Skewness7.72516
Sum80504
Variance7.58294
MonotocityNot monotonic
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%) 
11737247.9%
 
2963926.6%
 
335379.7%
 
418405.1%
 
57722.1%
 
06301.7%
 
65441.5%
 
83040.8%
 
72670.7%
 
161670.5%
 
Other values (36)8312.3%
 
(Missing)3801.0%
 
ValueCountFrequency (%) 
06301.7%
 
11737247.9%
 
2963926.6%
 
335379.7%
 
418405.1%
 
ValueCountFrequency (%) 
1151< 0.1%
 
701< 0.1%
 
503< 0.1%
 
471< 0.1%
 
457< 0.1%
 

amenities
Categorical

HIGH CARDINALITY

Distinct count28222
Unique (%)77.8%
Missing0
Missing (%)0.0%
Memory size1.5 MiB
{}
 
82
{TV,Wifi,"Air conditioning",Kitchen,"Smoking allowed",Elevator,Heating,Washer,"Smoke alarm","Carbon monoxide alarm","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Private entrance"}
 
77
{TV,Wifi,"Smoking allowed","Pets allowed","Family/kid friendly","Suitable for events",Washer,"Fire extinguisher",Essentials,Shampoo,"Hair dryer","Laptop-friendly workspace"}
 
71
{TV,Wifi,"Air conditioning",Kitchen,"Smoking allowed","Pets allowed",Elevator,Heating,Washer,"Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer","Private entrance"}
 
67
{TV,Wifi,"Air conditioning",Elevator,Heating,Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Private entrance"}
 
53
Other values (28217)
35933 
ValueCountFrequency (%) 
{}820.2%
 
{TV,Wifi,"Air conditioning",Kitchen,"Smoking allowed",Elevator,Heating,Washer,"Smoke alarm","Carbon monoxide alarm","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Private entrance"}770.2%
 
{TV,Wifi,"Smoking allowed","Pets allowed","Family/kid friendly","Suitable for events",Washer,"Fire extinguisher",Essentials,Shampoo,"Hair dryer","Laptop-friendly workspace"}710.2%
 
{TV,Wifi,"Air conditioning",Kitchen,"Smoking allowed","Pets allowed",Elevator,Heating,Washer,"Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer","Private entrance"}670.2%
 
{TV,Wifi,"Air conditioning",Elevator,Heating,Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Private entrance"}530.1%
 
{TV,Wifi,"Air conditioning",Kitchen,Elevator,Heating,Washer,Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Hot water"}470.1%
 
{TV,Wifi,"Air conditioning",Kitchen,Elevator,Heating,Washer,"Smoke alarm",Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Hot water"}410.1%
 
{TV,Wifi,"Air conditioning",Kitchen,Elevator,Heating,Washer,Essentials,Shampoo,Hangers,"Hair dryer","Hot water"}410.1%
 
{Wifi,"Air conditioning",Kitchen,Gym,Elevator,Heating,"Suitable for events",Washer,"Smoke alarm","First aid kit","Fire extinguisher",Hangers,"Laptop-friendly workspace","Hot water"}400.1%
 
{Wifi,"Air conditioning",Elevator,Heating,Washer,Essentials,Shampoo,"Lock on bedroom door","Hair dryer","Laptop-friendly workspace"}380.1%
 
Other values (28212)3572698.5%
 

Length

Max length1917
Median length261
Mean length305.847
Min length2

square_feet
Real number (ℝ≥0)

MISSING

Distinct count9
Unique (%)64.3%
Missing36269
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean607.571
Minimum0
Maximum4306
Zeros5
Zeros (%)< 0.1%
Memory size283.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median215.5
Q3603
95-th percentile2458.7
Maximum4306
Range4306
Interquartile range (IQR)603

Descriptive statistics

Standard deviation1141.87
Coefficient of variation (CV)1.8794
Kurtosis9.72691
Mean607.571
Median Absolute Deviation (MAD)215.5
Skewness2.99983
Sum8506
Variance1.30386e+06
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
05< 0.1%
 
6032< 0.1%
 
3771< 0.1%
 
551< 0.1%
 
6671< 0.1%
 
1081< 0.1%
 
14641< 0.1%
 
43061< 0.1%
 
3231< 0.1%
 
(Missing)36269> 99.9%
 
ValueCountFrequency (%) 
05< 0.1%
 
551< 0.1%
 
1081< 0.1%
 
3231< 0.1%
 
3771< 0.1%
 
ValueCountFrequency (%) 
43061< 0.1%
 
14641< 0.1%
 
6671< 0.1%
 
6032< 0.1%
 
3771< 0.1%
 

price
Real number (ℝ≥0)

SKEWED

Distinct count1007
Unique (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean726.046
Minimum0
Maximum70723
Zeros12
Zeros (%)< 0.1%
Memory size283.5 KiB

Quantile statistics

Minimum0
5-th percentile120
Q1255
median396
Q3651
95-th percentile2300
Maximum70723
Range70723
Interquartile range (IQR)396

Descriptive statistics

Standard deviation1861.04
Coefficient of variation (CV)2.56325
Kurtosis695.601
Mean726.046
Median Absolute Deviation (MAD)177
Skewness21.9409
Sum2.63431e+07
Variance3.46347e+06
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
29713703.8%
 
19811353.1%
 
39610953.0%
 
2697562.1%
 
2907011.9%
 
3686621.8%
 
3616141.7%
 
3895791.6%
 
1495481.5%
 
995101.4%
 
Other values (997)2831378.0%
 
ValueCountFrequency (%) 
012< 0.1%
 
281< 0.1%
 
421< 0.1%
 
573< 0.1%
 
64300.1%
 
ValueCountFrequency (%) 
707231< 0.1%
 
706561< 0.1%
 
700021< 0.1%
 
695991< 0.1%
 
689971< 0.1%
 

weekly_price
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count191
Unique (%)62.0%
Missing35975
Missing (%)99.2%
Infinite0
Infinite (%)0.0%
Mean2771.67
Minimum150
Maximum42000
Zeros0
Zeros (%)0.0%
Memory size283.5 KiB

Quantile statistics

Minimum150
5-th percentile485
Q11000
median1765
Q33000
95-th percentile7974.45
Maximum42000
Range41850
Interquartile range (IQR)2000

Descriptive statistics

Standard deviation3859.29
Coefficient of variation (CV)1.39241
Kurtosis44.3421
Mean2771.67
Median Absolute Deviation (MAD)976
Skewness5.69568
Sum853673
Variance1.48942e+07
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
10009< 0.1%
 
30008< 0.1%
 
16008< 0.1%
 
15008< 0.1%
 
12008< 0.1%
 
8007< 0.1%
 
20006< 0.1%
 
40005< 0.1%
 
4855< 0.1%
 
5605< 0.1%
 
Other values (181)2390.7%
 
(Missing)3597599.2%
 
ValueCountFrequency (%) 
1501< 0.1%
 
1601< 0.1%
 
2401< 0.1%
 
2551< 0.1%
 
2801< 0.1%
 
ValueCountFrequency (%) 
420001< 0.1%
 
272491< 0.1%
 
240001< 0.1%
 
201561< 0.1%
 
189121< 0.1%
 

monthly_price
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count185
Unique (%)58.5%
Missing35967
Missing (%)99.1%
Infinite0
Infinite (%)0.0%
Mean10359.7
Minimum400
Maximum150000
Zeros0
Zeros (%)0.0%
Memory size283.5 KiB

Quantile statistics

Minimum400
5-th percentile1600
Q13275.75
median6400
Q312097.5
95-th percentile30000
Maximum150000
Range149600
Interquartile range (IQR)8821.75

Descriptive statistics

Standard deviation13553.9
Coefficient of variation (CV)1.30833
Kurtosis40.9218
Mean10359.7
Median Absolute Deviation (MAD)3800
Skewness5.17697
Sum3.27367e+06
Variance1.83708e+08
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
600010< 0.1%
 
21008< 0.1%
 
40008< 0.1%
 
65007< 0.1%
 
180007< 0.1%
 
80006< 0.1%
 
29005< 0.1%
 
120005< 0.1%
 
30005< 0.1%
 
110005< 0.1%
 
Other values (175)2500.7%
 
(Missing)3596799.1%
 
ValueCountFrequency (%) 
4001< 0.1%
 
5301< 0.1%
 
6001< 0.1%
 
6251< 0.1%
 
8241< 0.1%
 
ValueCountFrequency (%) 
1500001< 0.1%
 
849321< 0.1%
 
782051< 0.1%
 
600003< 0.1%
 
500151< 0.1%
 

security_deposit
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count301
Unique (%)2.4%
Missing23793
Missing (%)65.6%
Infinite0
Infinite (%)0.0%
Mean655.045
Minimum0
Maximum35362
Zeros4281
Zeros (%)11.8%
Memory size283.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median200
Q3700
95-th percentile2000
Maximum35362
Range35362
Interquartile range (IQR)700

Descriptive statistics

Standard deviation2337.31
Coefficient of variation (CV)3.56816
Kurtosis139.773
Mean655.045
Median Absolute Deviation (MAD)200
Skewness11.0603
Sum8.18151e+06
Variance5.463e+06
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0428111.8%
 
20013403.7%
 
50010342.8%
 
100010072.8%
 
3008872.4%
 
7006301.7%
 
1005581.5%
 
8005261.4%
 
20002720.7%
 
6001680.5%
 
Other values (291)17874.9%
 
(Missing)2379365.6%
 
ValueCountFrequency (%) 
0428111.8%
 
52< 0.1%
 
106< 0.1%
 
208< 0.1%
 
251< 0.1%
 
ValueCountFrequency (%) 
353621< 0.1%
 
353191< 0.1%
 
350003< 0.1%
 
347362< 0.1%
 
343432< 0.1%
 

cleaning_fee
Real number (ℝ≥0)

MISSING
SKEWED
ZEROS

Distinct count186
Unique (%)1.4%
Missing23123
Missing (%)63.7%
Infinite0
Infinite (%)0.0%
Mean60.9431
Minimum0
Maximum10000
Zeros4361
Zeros (%)12.0%
Memory size283.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median40
Q370
95-th percentile150
Maximum10000
Range10000
Interquartile range (IQR)70

Descriptive statistics

Standard deviation218.669
Coefficient of variation (CV)3.58808
Kurtosis546.789
Mean60.9431
Median Absolute Deviation (MAD)40
Skewness20.1208
Sum802011
Variance47816.1
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0436112.0%
 
5015714.3%
 
1007302.0%
 
806691.8%
 
606481.8%
 
404591.3%
 
354461.2%
 
303681.0%
 
702950.8%
 
202780.8%
 
Other values (176)33359.2%
 
(Missing)2312363.7%
 
ValueCountFrequency (%) 
0436112.0%
 
15< 0.1%
 
25< 0.1%
 
35< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
100001< 0.1%
 
41571< 0.1%
 
41382< 0.1%
 
4000220.1%
 
33441< 0.1%
 

guests_included
Real number (ℝ≥0)

Distinct count16
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.36499
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Memory size283.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile4
Maximum16
Range15
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.25698
Coefficient of variation (CV)0.920869
Kurtosis47.1697
Mean1.36499
Median Absolute Deviation (MAD)0
Skewness5.89996
Sum49526
Variance1.58
MonotocityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%) 
13087385.1%
 
228868.0%
 
411763.2%
 
34271.2%
 
63481.0%
 
52050.6%
 
81210.3%
 
10700.2%
 
16550.2%
 
7410.1%
 
Other values (6)810.2%
 
ValueCountFrequency (%) 
13087385.1%
 
228868.0%
 
34271.2%
 
411763.2%
 
52050.6%
 
ValueCountFrequency (%) 
16550.2%
 
155< 0.1%
 
1411< 0.1%
 
134< 0.1%
 
12360.1%
 

extra_people
Real number (ℝ≥0)

ZEROS

Distinct count190
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.4737
Minimum0
Maximum2118
Zeros29249
Zeros (%)80.6%
Memory size283.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile100
Maximum2118
Range2118
Interquartile range (IQR)0

Descriptive statistics

Standard deviation79.1011
Coefficient of variation (CV)3.86355
Kurtosis302.14
Mean20.4737
Median Absolute Deviation (MAD)0
Skewness14.3015
Sum742846
Variance6256.98
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
02924980.6%
 
5017194.7%
 
10016504.5%
 
805761.6%
 
602990.8%
 
2002910.8%
 
1502850.8%
 
352140.6%
 
401670.5%
 
301230.3%
 
Other values (180)17104.7%
 
ValueCountFrequency (%) 
02924980.6%
 
14< 0.1%
 
31< 0.1%
 
51< 0.1%
 
82< 0.1%
 
ValueCountFrequency (%) 
21181< 0.1%
 
20841< 0.1%
 
20641< 0.1%
 
200016< 0.1%
 
19952< 0.1%
 

minimum_nights
Real number (ℝ≥0)

SKEWED

Distinct count66
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.30758
Minimum1
Maximum1086
Zeros0
Zeros (%)0.0%
Memory size283.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile14
Maximum1086
Range1085
Interquartile range (IQR)0

Descriptive statistics

Standard deviation28.3065
Coefficient of variation (CV)6.57133
Kurtosis672.164
Mean4.30758
Median Absolute Deviation (MAD)0
Skewness22.4918
Sum156292
Variance801.26
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
13021683.3%
 
221786.0%
 
310242.8%
 
308192.3%
 
73691.0%
 
53681.0%
 
153160.9%
 
901750.5%
 
101610.4%
 
60890.2%
 
Other values (56)5681.6%
 
ValueCountFrequency (%) 
13021683.3%
 
221786.0%
 
310242.8%
 
4790.2%
 
53681.0%
 
ValueCountFrequency (%) 
10861< 0.1%
 
10006< 0.1%
 
9997< 0.1%
 
6002< 0.1%
 
5001< 0.1%
 

maximum_nights
Real number (ℝ≥0)

SKEWED

Distinct count167
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28487.8
Minimum1
Maximum1e+09
Zeros0
Zeros (%)0.0%
Memory size283.5 KiB

Quantile statistics

Minimum1
5-th percentile30
Q11125
median1125
Q31125
95-th percentile1125
Maximum1e+09
Range1e+09
Interquartile range (IQR)0

Descriptive statistics

Standard deviation5.24986e+06
Coefficient of variation (CV)184.284
Kurtosis36283
Mean28487.8
Median Absolute Deviation (MAD)0
Skewness190.481
Sum1.03362e+09
Variance2.75611e+13
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
11252853178.6%
 
36518665.1%
 
3016084.4%
 
905781.6%
 
604061.1%
 
73841.1%
 
1002120.6%
 
311940.5%
 
151890.5%
 
1801870.5%
 
Other values (157)21285.9%
 
ValueCountFrequency (%) 
1480.1%
 
2370.1%
 
3780.2%
 
4300.1%
 
51300.4%
 
ValueCountFrequency (%) 
1e+091< 0.1%
 
500001< 0.1%
 
99991< 0.1%
 
18251< 0.1%
 
14601< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

idnameneighbourhoodneighbourhood_cleansedzipcodeproperty_typeroom_typeaccommodatesbathroomsbedroomsbedsamenitiessquare_feetpriceweekly_pricemonthly_pricesecurity_depositcleaning_feeguests_includedextra_peopleminimum_nightsmaximum_nights
044054Modern and Comfortable Living in CBDChaoyang朝阳区 / Chaoyang100022Serviced apartmentEntire home/apt92.03.04.0{TV,"Cable TV",Internet,Wifi,"Air conditioning","Wheelchair accessible",Kitchen,"Pets allowed",Elevator,"Free street parking","Buzzer/wireless intercom",Heating,"Family/kid friendly",Washer,"Smoke alarm","Carbon monoxide alarm","Safety card","Fire extinguisher",Essentials,Shampoo,"24-hour check-in",Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Self check-in","Smart lock","Children’s books and toys",Crib,"Hot water","Luggage dropoff allowed","Long term stays allowed","Paid parking on premises"}1464.0835.08373.027603.0708.071.0671.02365
1100213The Great Wall Box Deluxe Suite A团园长城小院东院套房NaN密云县 / Miyun101508Guest suitePrivate room21.01.01.0{Internet,Wifi,"Air conditioning","Free parking on premises",Breakfast,"Pets live on this property",Cat(s),"Indoor fireplace",Heating,"Family/kid friendly",Washer,Dryer,Essentials,Shampoo,Hangers,"Hair dryer","Private living room","Private entrance","Bed linens","Extra pillows and blankets","Garden or backyard"}NaN1203.07200.028800.00.00.010.0130
2114384CBD Luxury 1-bedroom suite with a 30m2 terraceITC朝阳区 / ChaoyangNaNApartmentEntire home/apt21.01.01.0{TV,"Cable TV",Internet,Wifi,"Air conditioning",Kitchen,"Smoking allowed",Doorman,Elevator,Heating,"Family/kid friendly",Washer,"Smoke alarm","Carbon monoxide alarm","Fire extinguisher",Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Hot water","Bed linens","Ethernet connection","Shower gel"}NaN602.04200.0NaNNaNNaN10.01730
3114465CBD Spacious Luxury Suite with 30 sqm terraceChaoyang朝阳区 / Chaoyang100022ApartmentEntire home/apt21.01.01.0{TV,"Cable TV",Internet,Wifi,"Air conditioning",Kitchen,"Smoking allowed",Doorman,Elevator,Heating,"Family/kid friendly",Washer,"Smoke alarm","Carbon monoxide alarm","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Hot water","Bed linens","Ethernet connection","Shower gel"}NaN602.04760.0NaNNaN30.010.011125
4128496Heart of Beijing: House with View 2Dong Si东城区NaNHouseEntire home/apt31.01.02.0{TV,Internet,Wifi,"Air conditioning",Doorman,Breakfast,Heating,"Family/kid friendly",Washer,Essentials,Shampoo,"Lock on bedroom door",Hangers,"Hair dryer",Iron,"Private entrance","Hot water","Bed linens","Extra pillows and blankets",Microwave,Refrigerator,"Dishes and silverware","Single level home","Patio or balcony","Garden or backyard","Luggage dropoff allowed","Long term stays allowed"}323.0411.03185.0NaN0.071.02106.03365
5161902cozy studio in center of BeijingDongcheng东城区100027ApartmentEntire home/apt21.01.01.0{TV,"Cable TV",Internet,Wifi,"Air conditioning","Wheelchair accessible",Kitchen,"Pets allowed",Doorman,Elevator,Heating,"Family/kid friendly",Washer,Dryer,Essentials,Shampoo,"24-hour check-in",Hangers,"Hair dryer",Iron,"Laptop-friendly workspace"}NaN552.0490.01600.00.00.010.01365
6162144nice studio near subway, sleep 4Dongcheng朝阳区 / Chaoyang100027ApartmentEntire home/apt41.01.02.0{TV,"Cable TV",Internet,Wifi,"Air conditioning","Wheelchair accessible",Kitchen,"Smoking allowed","Pets allowed",Doorman,Gym,Elevator,Heating,"Family/kid friendly",Washer,Dryer}0.0601.0560.01600.00.00.010.01365
7279078Nice Apartment in BeijingDongcheng东城区100027ApartmentEntire home/apt21.01.01.0{TV,"Cable TV",Internet,Wifi,"Air conditioning","Wheelchair accessible",Kitchen,"Smoking allowed","Pets allowed",Doorman,Elevator,Heating,"Family/kid friendly",Washer,Dryer,Essentials,Shampoo}NaN403.0445.01750.0700.00.0264.01365
8282825Large 2 BR Apt in a great locationSanlitun朝阳区 / Chaoyang100027ApartmentEntire home/apt42.02.03.0{TV,"Cable TV",Internet,Wifi,"Air conditioning",Kitchen,Doorman,Elevator,"Free street parking",Heating,"Family/kid friendly",Washer,Dryer,"Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Self check-in","Building staff","High chair","Hot water",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove}NaN743.04884.0NaN0.0283.020.0390
9287026Studio in downtown Beijing #2Dongcheng朝阳区 / Chaoyang100027ApartmentEntire home/apt31.01.01.0{TV,"Cable TV",Internet,Wifi,"Air conditioning","Wheelchair accessible",Kitchen,"Smoking allowed",Doorman,Elevator,Heating,Washer,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace"}NaN418.0430.01500.01000.035.0280.01365

Last rows

idnameneighbourhoodneighbourhood_cleansedzipcodeproperty_typeroom_typeaccommodatesbathroomsbedroomsbedsamenitiessquare_feetpriceweekly_pricemonthly_pricesecurity_depositcleaning_feeguests_includedextra_peopleminimum_nightsmaximum_nights
3627343821919Danny’ s apartmentNaN通州区 / TongzhouNaNCondominiumEntire home/apt21.01.01.0{TV,Wifi,"Air conditioning",Kitchen,Elevator,"Hot tub",Heating,Washer,"Smoke alarm",Essentials,Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Hot water"}NaN283.0NaNNaNNaNNaN10.016
3627443822638中关村科学院东南小区带电梯南北通透户型Haidian海淀区NaNCondominiumEntire home/apt21.02.02.0{TV,Wifi,"Air conditioning",Kitchen,"Pets allowed",Elevator,Heating,Washer,Hangers,"Hair dryer","Laptop-friendly workspace","Hot water"}NaN283.0NaNNaNNaNNaN10.030365
3627543823221京韵邻里·记忆大床房站牛街小吃街旁近法源寺南锣鼓巷天坛后海康有为故居谭嗣同故居北京站南站西站Xicheng西城区NaNBoutique hotelPrivate room21.01.01.0{TV,Wifi,"Air conditioning",Heating,"Suitable for events","Smoke alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door","Hair dryer","Laptop-friendly workspace"}NaN983.0NaNNaNNaNNaN10.011125
3627643825036【路客】初夏特惠|同仁医院| 崇文门地铁站|北京站·天安门·王府井|温馨·三室一厅一卫Dongcheng东城区NaNHouseEntire home/apt61.03.03.0{TV,Wifi,"Air conditioning",Kitchen,Elevator,Heating,Washer,"Smoke alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer","Self check-in",Lockbox,"Hot water","Ethernet connection",Refrigerator,"Dishes and silverware","Cooking basics"}NaN899.0NaNNaN0.0150.060.0130
3627743836478loft大床房Dongcheng东城区NaNBoutique hotelPrivate room21.01.02.0{TV,Wifi,"Air conditioning",Heating,"Suitable for events","Fire extinguisher",Shampoo,Hangers,"Hair dryer","Hot water"}NaN396.0NaNNaNNaNNaN10.011125
3627843836486墅境洋房身临其境NaN大兴区 / DaxingNaNVillaEntire home/apt54.05.010.0{TV,Wifi,"Air conditioning",Kitchen,"Free parking on premises","Smoking allowed","Pets allowed",Gym,"Hot tub",Heating,"Suitable for events",Washer,"Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance","Hot water"}NaN1499.0NaNNaNNaNNaN10.01365
3627943836628loft家庭房Dong Si东城区NaNBoutique hotelPrivate room41.01.02.0{TV,Wifi,"Air conditioning",Heating,"Suitable for events","Fire extinguisher",Shampoo,Hangers,"Hair dryer","Hot water"}NaN503.0NaNNaNNaNNaN10.011125
3628043836721豪华间大床房Dong Si东城区NaNBoutique hotelPrivate room21.01.01.0{TV,Wifi,"Air conditioning",Heating,"Suitable for events","Fire extinguisher",Shampoo,Hangers,"Hair dryer","Hot water"}NaN701.0NaNNaNNaNNaN10.011125
3628143836771落地窗大床房Dongcheng东城区NaNBoutique hotelPrivate room21.01.01.0{TV,Wifi,"Air conditioning",Heating,"Suitable for events","Fire extinguisher",Shampoo,Hangers,"Hair dryer","Hot water"}NaN701.0NaNNaNNaNNaN10.011125
3628243836973中关村新东方微软大厦总部北京大学人民大学新中关字节跳动西屋国际公寓月租短租Zhongguancun海淀区NaNHostelPrivate room21.05.05.0{Wifi,"Air conditioning","Smoking allowed","Pets allowed",Elevator,Heating,Washer,"Smoke alarm","Fire extinguisher",Shampoo,Hangers,"Hair dryer","Laptop-friendly workspace","Hot water"}NaN191.0NaNNaNNaNNaN10.011125